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The Remainder in Linear Methods of Approximation 

By W. E. Milne ' 

The following discussion grows out of an effort to formulate a systematic treatment 
of the error for such diverse processes as interpolation, numerical differentiation, numerical 
integration, harmonic analysis, approximation by Least Squares, approximation by equating 
moments, and other allied operations. 

First of all, a systematic and uniform procedure is exhibited by which a desired method 
of approximation may be explicitly constructed. Second, by suitable transformation the 
remainder is put in a form in which it is possible in many cases to estimate its approximate 
magnitude. And third, tin 1 theory is applied to a variety of concrete examples, and bounds 
are obtained for the magnitude of the error. 



I. Fundamental Problem of Approximation 

In the various types of problems to which this 
investigation applies, we are dealing with a real 
function f(x) in an interval aSx^b, and we are 
supplied with a more or less adequate description, 
or definition, of this function. For example, we 
may have a table of values of /'(./') furnished by 
experimental observations, or a table of values 
of /(./:) computed from a series or an asymptotic 
expansion, or we may have the values, not of /'(./: ) 
itself, but of certain of its derivatives, /'(#), or 
f"(x) etc. In such instances /'(.r) is not completely 
defined by the information provided. On the 
other hand, f(x) may be a particular solution of a 
given differential equation, for which no solution 
in elementary form is known. Or f(x) may be 
the solution of an integral equation or of some 
other type of functional equation. In such cases 
the function f(x) may be completely, though 
implicitly, defined by the given equation. 

The essential point is that we may not have a 
simple mathematical expression for j(x) , and yet 
we wish to obtain particular values of f(x), or 
to find its integral, or to find its derivatives — in 
short to be able to use j(x) just as freely as if it 
were expressed in terms of known elementary 
functions. 

In order to do these things, a common practice 
is to replace /(x) by an approximate expression in 
terms of suitably chosen known functions. How 
to secure this approximate representation of j(x) 



is the problem that we shall call the fundamental 
problem of approximation. 

In order that the subsequent analyses can be 
performed, it will be assumed henceforth that f(x) 
has a continuous derivative of order n | 1 in a fixed 
interval a^x^b. The values of n to be selected 
will appear as we proceed. 

II. Basic Functions 

In all the types of approximation here considered 
the first essential idea is to employ a basic set of 
linearly independent known functions u d (x), 
Ui(x) } . . ., u n (x), and to use an appropriately 
chosen linear combination of these functions as 
an approximation to the given function. So far 
as theory is concerned any other set v (x), th(x), 
. . ., v n (x), equivalent to the set {Ui(x)} by linear 
combination, might equally well be used. In 
practical numerical analysis however, it is impor- 
tant to select the set best suited to the numerical 
process to be employed. 

In numerical methods the set of basic functions 
more often employed than all others put together 
is the set of polynomials, 1, x 9 x 2 , . . ., x n . 
Actually these appear more frequently in various 
lineai' combination such as the binomial coefficient 

functions (a)'(?)> * * " (^Y the Legendre poly- 
nomials P (x), Pi(x), . . ., P n {x), the Lagrangian 
coefficients L { n) (x), L[ n) (x), . . . L ( n n) (x), etc. 
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Next in importance probably are the trigo- 
nometric functions, 1, cos x, sin x, . . . cos rax, 
sin mx which may also occur in power form, 1, 
cos x, sin &,.".., cos w x, sin^x, or in products of 
terms such as sin (x—x^/2. 

Other sets sometimes used are e~ x212 H t (x), 
where H f (x) is an Hermitian polynomial, e~ x L t (x), 
where L t (x) is a Laguerre polynomial, etc. 

It is assumed henceforth that the basic functions 
Uo(x), Ui(x)j . . ., u n (x) all possess continuous 
derivatives of order n-\-\ in the closed interval 
aSx^b, and that the Wronskian determinant 



W r = 



Uq(x) 

u (x) 



Ui(x) 

u[{x) 



u' n {x) 



u (n) (x) u x (n) (x) 



u, 



{n) 



(x) 



does not vanish anywhere in the interval a^x^b. 

III. Equivalence Operators 

The basic set of approximating functions having 
been selected, the next step in solving the funda- 
mental problem of approximation is to set up the 
rule by means of which the coefficients in the 
approximating linear combination of basic func- 
tions are to be determined. We shall consider 
here only objective, analytic methods, as opposed 
to subjective and graphical methods. With this 
restriction, it will be found that in a great many 
cases of practical importance, the rule is based on 
what might be called the principle oj equivalence. 
We shall not attempt to formulate a completely 
general statement of this principle, but shall 
instead bring out its meaning by citing some 
familiar examples in which a principle of equiva- 
lence is employed. Such examples are: 

(a). Equivalence of value at discrete points. 
That is, the value of the approximating function 
is made equal to the value of the given function 
fix) at Xq, X\j . . ., x n . 

(b). Equivalence of lineal elements at discrete 
points. That is, the values of the approximating 
function and its first derivative are made to agree 
with j(x) and f(x) at assigned points. This may 
also be regarded as a limiting case of (a) when 
pairs of points move into coincidence. 



(c) Equivalence of weighted definite integrals. 
That is, the value of 



w<(x) j{x) dx, 

Ja 



are made to agree with corresponding expressions 
formed for the approximating function. 

(d) Equivalence of weighted sums. That is, 
the values of 

M 

S Wifo)/fo)i 

.7=0 

are made to agree with similar expressions for the 
approximating function. 

Evidently this list can be greatly extended. 

We shall henceforth in this paper consider only 
cases in which the coefficients are completely 
determined by some appropriate application of 
the principle of equivalence. 

In order to treat all such cases by means of a 
comprehensive notation, it is convenient to 
introduce a set of operators O , U . . ., 0«, equal 
in number to the set of basic functions, and called 
equivalence operators. What these operators actu- 
ally do in any particular method of approximation 
is determined by the rule selected for setting up 
the approximation. For the examples (a), (b), 
(c), and (d) above the corresponding operators 



are: 



n. 



(a) OJ(x)=f(x i ); i=l, • • -,n. 

(b) 0.i/(aO =/(*<), i=l 

2i+ J(x)=f(x i ); 

(c) OJ(x)= f wi{x)j(x)dx) 

J a 



i = l, . . .,?l. 



(d) O i f(x)=^w ij f(x j ); i=l, 

;'=o 



.. n. 



It is desirable to employ some form of analytical 
expression for the operators t , sufficiently general 
to include all types of linear operators likely to 
arise, but sufficiently restricted to guarantee that 
subsequent mathematical manipulations can be 
carried out, Accordingly, it is assumed hence- 
forth that O t f(x) can be expressed by means of 
Riemann-Stieltjes integrals in the form 

Oij{x) = iz [ b j^^dg^.k^n, 

j=0 J a 
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in which the %ij(x) are suitably chosen functions 
of bounded variation in the interval a^x^b. 
By specialized choices of the q tj we may obtain 
each of the particular operators (a), (b), (c), and 
(d) listed above. For example, if q it (x) is a- step 
function with unit step at x=x t while the other 
q_ij(x) vanish we have 



OJ(x)= f f i Hx)dq it (x)=f t Kx i 

J a 



Again if dq i0 (x)=Wi(x)dx and <fo=0 for j=l, 
2, . . ., k, then 



OJ(x)= Wi{x)j{x)dx. 

J a 



Also if q i(y {x) is a step function having M+l steps 
of magnitude w i0 at x , w n at x u . . ., w m at x M , 
and if 2^=0, for j^>0, then 

From the definition of 0/ as a sum of integrals, 
it is clear that 

(1) The operation f is distributive with respect 
to addition, i. e., 

0t[Mx) +Mz)]mO<f l (z)+O i fi(z), 

(0 1 +0 2 )J(x)^OAx)+0 2 f(x). 

(2) If A is constant 

0JLAf(z)]=A0J(z). 

(3) The result of the operation d on any func- 
tion f(x) no longer contains z as an independent 
variable and insofar as the variable x is concerned 
may be treated as a constant. 

IV. The Approximation Formula 

Suppose that the function f(x) has been given in 
any of the ways suggested in section I, that an 
appropriate basic set {Ui} has been chosen, (hat 
a rule for determining the coefficients based on the 
principle of equivalence has been selected (i. e., 
the equivalence operators h 2 , . . ., n are de- 
rided upon), and that what we desire is a formula 
expressing the result of some given operation 
when performed on f(x), where satisfies the 



conditions imposed on the { in section III. Let 
the desired approximating linear combination of 
the Uj be 

^AjUjix), 
./=o 

in which the ^4's are coefficients to be determined. 
Then the principle of equivalence gives the n-\-l 
equations 

0J(x)=0 i f:A j u j (x). 
j=o 

which by the properties stated at the end of 
section III are the same as 

0J(z) = i2A,0&,fr). (1) 

It is assumed henceforth that the determinant 

O u (x) OiU (x) . . . O n u (x) 

O Ui(z) OiU^x) . . . O n Ui(x) 



D= 



O u n (x) O x u n {;x) 



O n u n (x) 



does not vanish. Clearly the n+1 eq 1 determine 
the coefficients Aj uniquely. 

Our objective is to express the result of the 
operation applied to f(x) by means of the same 
operation, 0, applied to the approximating 
function '2AiU i (x) ) together with a remainder 
term, R, denoting the error of this approximate 
representation. That is 

af(x)-0 2 A i ii i (x)=R. 

When the values of the A t as determined by 
eq 1 are substituted in the expression for R, the 
left-hand member may be written in determinant 
form, and we have the general expression for the 
error term 

Of(x) Oof(x) 0J(x) . . . 0J{x) 

Ou (x) O u (x) 1 u (x) s . . . 0„u (x) 

Oui(x) O Ui(x) OiUiix) . . . O n Ui(x) 



1 
D 



Ou n (x) O u n (x) OiU n (x) . . . O n u n (x) 



R. 



(2) 



Formula 2 provides the explicit means for setting 
up any linear approximation formula when the 
desired operation 0, the basic set Ui(x), and the 
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rule for the coefficients based on the principle of 
equivalence have been selected. 

It is obvious from formula 2 that R vanishes 
whenever f(x) is any linear combination of the 
basic set {u t (x) } . Also, R vanishes if the operator, 
0, is any linear combination of the operators , 
O u . . ., O n . 

V. The Operator R 

Since we have assumed that the operator, 0, is 
expressible by means of Riemann-Stieltjes inte- 
grals just as are the O i} it is clear from formula 2 
that R may be looked upon as the result of an 
operation of this same general type performed on 
the function j(x) and might be called 0'j(x). How- 
ever, we wish to retain the letter R associated with 
the idea of a remainder term, and hence shall use 
the notation 

R=--m.m], 

indicating both that R is a remainder and also 
that R is the result of a linear operator operating 
on the function f(x). 

This operator also obeys the distributive prop- 
erty with respect to addition so that 

Example 2. Lineal element interpolation . 

m j(x ) f( Xo ) 



j. 

D 



1 



1 

Xq 

xl 





1 

2z 



Example 3. Quadrature formula. 



R[fi(x) +Mx)} = R[f 1 {x)]+R[f 2 (x)]. 
Moreover, as we observed at the end of section 



IV, 



R[u t (x)] = 0, i=0, 1, 



n. 



These two properties of R are essential in the sub- 
sequent developments. 

A few examples showing the formation of R[f(x)] 
in specific cases will be illuminating. In each case 
D is the cofactor of the element in the upper left- 
hand corner of the numerator determinant. If 
the values x G , x h . . ., x n are all distinct it may 
be shown that in none of these cases does D 
vanish. 

Example 1. Polynomial interpolation. 

Here the formula is 



f(x) 


/(*>) 


/(*,) 


1 


1 


] 


x 


Xq 


X! 


x 2 


4 


xi 



Xq 



f(x n ) 

1 



f(x n ) 



1 

2x 



x'i" + 1 (2v + l)x 2 n n 



f(x n ) 

1 

x 2 , 



=/?[/(*>!. 



-mm]- 



D 



jj(x)dx 


$ — a 


$ 2 -a 2 


2 


3 -<* 3 


3 


ftn+l a n + l 


n + l 



1 

Xq 
Xq 



M) 

l 

Xl 

x\ 



J\Xn) 
1 



af 



=-»[/(*)]• 
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Example 4. Taylor's series with remainder 



1 
D 



1 

#0 





1 
2x 



X n Xq W'Xo 

Example 5. Least square approximation. 

Since any linearly independent set of functions 
can be made orthonormal by linear combination, 
the basic set itself may be assumed to be ortho- 
normal with respect to a given interval a^x^b. 
Then with the abridged notation 

\f u=s f(x)u(x)dx, etc., 
the formula is 

/(*) ffrh ffih ffUn 

u Q (x) 

jzUtix) 



u n {x) 



I) 



=R\f(x)]. 



A Fourier series with remainder is a particular 
case of this example 4 . So also is Least Square ap- 
proximation using Legendre Polynomials. 

The procedure outlined above evidently lias 
great flexibility, since theoretically at least there 
arc infinitely many ways of choosing a basic set 
of functions, infinitely many ways of selecting the 
rule for forming the coefficients based on the 
equivalence principle, and similarly for f(x) and 
the operator 0. 

It should be emphasized that although eq 2 has 
been called "an approximation formula", it is not 
necessarily in any sense a "good" approximation. 
Only when the remainder term P[f(x)] is small 
enough to be ignored without affecting the desired 
accuracy do we call it a good approximation. 

VI. The Basic Differential Equation 

Since the functions Ui(x) have continuous 
derivatives of order n + 1 and the Wronskian, W, 



-R[j{x)]. 







n\ 



does not vanish in the interval a^x^b, wc may 
construct a homogeneous linear differential equa- 
tion 

L(u)=D n+l n+P l (x)D n n + 
P 2 (x)D n - 1 u+ . . . +P n+1 {x)u=0 

of order n+\ with coefficients Pi(x) continuous in 
a^x^b, which has the n-\-l functions //,(./•) as a 
set of linearly independent solutions. 

I n determinant form t his differential equation is 



1 
W 



(n + l) 




u (7 



<" J) 



u 

u 
u x 



= 



Example 1. For the basic set of polynomials 
I, ./■. . . ., x n y the differentia] equation is simply 

L(u)=n n + l u = (). 

Example 2. For the basic trigonometric func- 
tions, 1, cos x, sin a:, . . ., cos mx, sin mx, the 
equation is 

D(D 2 +l)(D 2 + 2 2 ) . . . (D 2 +m 2 )u=0. 

Example 3. For the basic hyperbolic set 1, 
cosh x, sinh x, . . ., cosh mx, sinh mx, the 
equation is 

]){I)--]){D 2 -2 2 ) . . . (D 2 -m 2 )u=0. 

Example 4. For the basic exponential set 
e x , e 2x , . . ., e mx the equation is 

(D-\)(D-2) . . . (D-m)u=0. 
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It is of interest to note that in each of these four 
examples the differential equation has constant 
coefficients. In general, of course, the coefficients 
are functions of x. It may also be noted that in 
the first three examples above the differential 
equation is self-adjoint. 

The general solution of the nonhomogeneous 
differential equation 



is 



L(u)=(t>(x), 

u=it,C<u i (x)+ cf>(s)g(x,s)ds, (3) 

i=0 J a 

in which the C 1 " s are arbitrary constants and 
u (x) Ux(x) . . . u n (x) 



g(x,s)-- 



n.(n-l) 
LL 



(s) <-"(«) 



u (s) u[(s) 

u (s) u^s) 






r x) (s) 



u' (s) 

Uq(s) 



u[(s) 



u' n (s) 
u n (s) 



u n (s) 



(4) 



The essential property of g(x,s) is that 



iffto*) = 



■{! 



at x = s for k=~-0, 1, 
at x=s for k = n. 



71-1 



dx kyx y (1 at x=s for k = n. (5) 

VII. The Function g{x, s) 

For the basic set 1, x, x 2 , . . ., x n we find that 

g(x, s) = (x—s) n /n\. 

For the basic set 1, cos x, sin x, . . ., cos mx, 
sin mx, 

fif (x, S )=[2sin(^)] 2n / /(2n)!. 

For the set 1, cosh x, sinh x, . . ., cosh mx, 
sinh mx, 

g(x, s) =[2 sinh (^)] 2n /(2n)l 



For the functions e x2/2 H n (x), 

g(x, s)=e is2 ~ x2)/2 (x—s) n ln\. 

For the functions e~ x L n (x), 

g(x, s)=e s ~ x (x—s) n /n\. 

For the set e x , e 2x , . . ., e (n+1) x, 

g(x, s) = (e x ~ s -l) n ln\. 

VIII. Expression for the Remainder 

Let f(x) be a function with a continuous deriva- 
tive of order n -f 1 in the region under consideration 
and set up the nonhomogeneous differential 
equation 

L { u)=L(j(x)). 

A particular solution of this equation is obviously 
u=f(x), and hence by eq 3 

j{x) = J^a i u i {x) + P L(j(s))g{x,s)ds. (6) 

J a 

Suppose now that we have some particular oper- 
ator, R, where R is formed for the same basic set 
as is eq 6. Applying the operator, R, to eq 6 we get, 
in view of distributive property of R, 

Rim^^aMuiix^+R [JJ L(J(s))g(x,s)ds^ 

Since R[Ui(x)] = 0, i=0, 1, . . ., w, the first term 
on the right drops out. Now R applied to any 
function <p(x) has the form 

fifo(s)] = S f ^ i) {x)dg i {x), k<n, 

where the ^{x) are of bounded variation in 

a^xSb. 

Consequently 

b\_£ L(J(s))g(x,s)d s y= 

§T £ [.f L ^ s ^(x,s)ds'j dq t (x). 

By successive differentiations of the integral and 
use of eq (5) at each step we obtain 



dx 



l - t [ £ L(J( S ))g(x, .)]= = £ L(f(s)) ^ g(x, s)ds; 
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hence 



R^£ L(f(s))g(x, s)ds'j 

k rb rx 5< 

-=SJ a I L)f(s))^ g(x ) s)dsdq i (x). 

It is convenient to define g(x, s) by the conditions 
(g(x, s) for x^s 



g(x, s) = 







for x<s. 



(7) 



Then the functions Wgix, s)/dx\ i=0, 1, . . ., n— 1, 
are continuous in the square a^xt^b, a^s^b, 
and vanish identically in that half of the square 4 
for which x<s. Hence in the double integral 
above we may replace g by g, replace the variable 
upper limit x by the constant limit b, interchange 
the order of the integrations, and put the summa- 
tion under the first integral. The result is 

/?[£ L(j(s))g(x,s)ds^=^ L(f(s))6(s)ds, 

where we have set 

k rb fti 

=B\g(x, 8)]. 
We have therefore 



R[f(x)]= P L(J{s))G(s)ds, (8) 

Ja 



in which 



G{s)=R\g(;x,s)]. 



(9) 



Note that R operates on g(x, s) as a function of x, 
not s. Equation 8 provides the desired expression 
for the remainder term. It should be noted that 
eq 8 is merely a mathematical identity and there- 
fore in one sense tells us nothing new. Actually, 
however, eq 8 accomplishes the important step 
of separating the problem into two parts, one of 
which, L(f(s)), is independent of the operator R, 
and the other, 6(s) , independent of the f unction/(:r) . 

IX. The Function G(s) 

For brevity the function G(s) has been written 
as a function of the single variable s, since s is 
the variable of immediate concern in the integral 
of eq 8. In reality G(s) depends, directly or in- 
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directly, also on all the values of x, fixed or vari- 
able, that are implied in the performance of the 
operation R. When s is larger than the largest x 
involved in the operation R[f(x)], the function 
G(s) vanishes identically because for all such values 
of s the function g(x,s) is zero by definition. Again 
if s is less than the least x involved in R[j(x)\ 
the function G(s) vanishes identically, this time 
because g(x,s)=g(x,s) and is therefore simply a 
linear combination of the Ui(x), so that 

R[g{x,s)]=0. 

It follows that the limits a, b, of the integral in 
eq 8, may as well be replaced by -co, + <», 
respectively. 

The behavior of G(s) in the interval where it 
does not vanish can best be illustrated by a 
number of simple examples. 

Example 1. Let the basic set be 1, x, x 2 , # 3 , 
let x , Xi, x 2 , x 3 , # 4 be five equally spaced values of x 
with interval h, let yi=j{Xi) and let 

^[/W]=^-47/ 3 +6?/ 2 -47/ 1 +2/o = AV 
For this case 

g(x,s) = (x-sy/3\,x>s, 



whence 



G(s)=R[g{x,s)] = y[(xi-s)-±(x 3 -sy+ 



6(32— s) —4(xi — s) +(x —s) ], 
where the notation (x-sf means that 

(x— s) z if x^s, 
ifx<s. 



{x—s) 



We now see at once that for 
#4S^<°° G[s)=0 



X Z S<<C%4 

x 1 s<<Cx 2 
— °°<sOo 



<7(s)=^(* 4 -s) 3 



£(*)=£! [(x,-sf-i(x,-sf] 
G(s)= -~ [-4(x 1 -s) 3 +(x -s) 3 ] 



G(s)=-±[(x Q -sy] 



G(s)=0. 
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The graph of this function is shown in figure 1. 




Figure 1. 

Thus it is seen that G(s) consists of four arcs of 
four different third-degree polynomial curves 
joined together continuously and with continuous 
first and second derivatives. 

Since we started with R[f(x) ] = A 4 i/ , we have 
shown that 



A i yo=£f i \s)G( s )ds, 



when G(s) is the function just described. 

Example 2. Let the basic set be 1, x, x 2 , x 3 , x*, 
let Xq, x u x 2 be three equally spaced points with 
interval h, and let 

R[j(x)]=y 2 -y -~ (y'2+fy'i+yo)- 

(This is equivalent to Simpson's Rule) . 
For this case 



whence 



0(8) 



g(x,s) = (z-s) /4! 
fa— s) 4 — fa— s) 4 



4! 



"3X3! f^ 2— s ) +4fa— s) +(x Q — s) ]. 
We have then: 

0(a) = 



X 2 ^zS<^co 



G(s) 



X ^S<^Xi G(s) = 



= (x 2 —sY h(x 2 —s) z 
4! 3X3! 

fa— s) 4 , h(x — s) z 



— °° OOo 



4! 
G(s) = 0. 



3X3! 



The graph of this function is shown in figure 2. 
Here G(s) consists of arcs of two fourth-degree 
polynomials joined together with third-degree 
contact. 

Recalling the original definition of R[j{x)] in 
this example we see that 
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h r°° 

2/2—2/0—3 (y'2+±y[+y'o)= \_j i5) (s)G(s)ds. 

The right-hand member of this equation is there- 
fore a measure of the error of Simpson's rule in the 
special form chosen here. 




Figure 2. 

Example 3. In the two preceding examples we 
have started with a known operator, R, and have 
derived from R the function G(s). Now that we 
see something of the character of G(s) in the case 
of a polynomial basic set, it is clear that we could 
reverse the process — i. e., start with a G(s) 
satisfying suitable conditions as to degree and 
continuity and derive from it the operator R. 
To illustrate this procedure we start with 



67(s)=0 


if Xi^s, 


6(s) = {x 1 -s)\s-x y 


if x ^s<Cxi, 


C7(s)=0 


if s<^Xo. 



Here G(s) is a polynomial of degree six, so that 
the basic set is 1 , x, x 2 , x z , x*, x 5 , x 6 , the differential 
equation is D 7 u=0, and 

R[M]= j ' m _jv ( S )i^7rj^x7) s d S 

= f m (s)(x 1 -sy(s-x^yds. 

JXl 

Now G{s) = G'(s)—G"(s)=0 at s=x and at s=z x . 



G'"(z 1 ) = -6h > 

G«\x l ) = -72h 2 

(?«>(»,) = -720. 



G'"(xo)=Qh 3 
G w {xo) = -72h 2 
£ (5) (z ) = 360/t 
G m (x ) = -720 
Integration by parts gives accordingly 
RV(x)]=6hW"(x l )+f"(x )] 

-72hy"(x 1 )-f"(xo)]+smh[f(x 1 )+f(x )] 

-720[J{x 1 )-f{x )}. 
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Transposing and dividing by 720 we obtain the 
formula with remainder term 

%.) -A*o)=f Lf(*0 +f(xo)]-~ [/"(*,) -/"(*«)]+ 

i ^[/'"fe)+/"' W] - t4 JV' (*) txt-sns-XoVds. 

This formula might be looked upon as the first 
few terms with remainder of a sort of two-point 
Taylor expansion. It can be used in successive 
approximations to secure a starting value in the 
step-by-step integration of a differential equation. 
Its virtue lies in the fact that while it requires only 
the first three derivatives it is as accurate as a 
Taylor's series using the first six derivatives. 

This example illustrates that fact that G(s) is a 
kind of generating function from which the parti- 
cular operator R associated with G{s) can be 
obtained by carrying out the integration 



/: 



f n+}) (s)G(s)ds. 



It is this properly which suggested the notation 

G(s). 

Example 4. In certain instances it may be 
easier to determine the contact properties by 
which the arcs composing the function G(s) are 
joined and to construct G(s) from these data 
rather than to compute R[(x— s) n ] directly. An 
instance of this is the following: Let the basic 
set be 1, x, . . . , x n , and let R[f(x)] represent the 
difference between j{x) and the best n-{\\ degree 
polynomial approximation by Least Squares for 
the interval ( — 1, 1). We shall suppose that x 
is in this interval. A brief examination will show 
that G(s) is of degree n in x, of degree 277 + 1 in s, 
vanishes at s = l to the same order as (1— s) n+1 , 
vanishes at s= — 1 to the same order as (l+s) n+1 , 
and if G R and G L are the polynomials to the right 
and left respectively of s=x, then 

n \ x s ) i n 

***" n! + *' 

These data determine G R and G L completely and 
after considerable algebraic reduction it is found 
that 

^)= 2 ^;[g^|(x+i)»(,-iyj(i+#} 

for x<s<l. 



The foregoing examples furnish an idea of the 
character of (Gs) when the basic set consists of 
polynomials. Similarly when the basic set is 
trigonometric we would expect to find that G(s) is 
composed of trigonometric arcs welded together; 
for the exponential case, G(s) would be made up of 
exponential arcs, etc. 

X. Computation of the Error 

As has been shown, the remainder is of the form 

R[f(x)]=j~ a L(f( S ))6{s)ds. 

If a^s^b is the interval in which G(s) is not 
identically zero, we have 



\R[f(x)]\<MK(b-a), CIO) 



in which 



M=max| £(/(>)) | in a<s<6, 

Z£=max|<7(>)|in a<s<b. 

The inequality 10 provides a bound for the mag- 
nitude of the error in all cases. 

In many particular problems this result can be 
considerably improved. For example, if G(s) does 
not change sign in the interval (a,b) we may apply 
the law of the mean for integrals and write 

R[M]=L(M)J° w G(s)ds, (11) 

where a<^<6. Note that this is the case for 
examples 1, 2, and 3 of section IX. Since in 
those examples we have the explicit expressions 
for the several arcs of which G(s) is composed, 
there is no difficulty in performing the integration 
over each arc and thus we can obtain the actual 
value of the integral 



r. 



G(s)ds. 



However we now show that the value of the above 
integral may be obtained from the operator R 
without making any use of the explicit form of 
G(s). Let w(x) be any solution of the differential 
equation 

L(w) = l, 

and replace j(x) in eq 8 by w(x) . The result is 
R[w(x)]= P G{s)ds. 

,7—00 
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By using this result in eq 11 we have 
B[f(z)]=L(f<£))B[v>(.x)], 
provided G(s) does not change sign. 

XI. Polynomial Approximation 
For the polynomial case where 



(12) 



L{u)- 



d n+1 u 



=0, 



'dx n+r 

a particular solution of L(w) = l is 

w(x)=x n+1 /(n+l)L 

Hence we have the theorem : 

if, (a) the basic set is 1, x, . . ., x n , and 

(b) the function G(s) does not change sign, 
then the error is 



/(n+l) (t) 



(13) 



where a<£<6. 



This result contains many well-known expressions 
for the error as special cases. For example, in 
the case of polynomial interpolation eq 13 re- 
duces to 

R U( X )]=* r n+l \ \ (x—x )(x—x 1 ) . . . (x—x n ). 

In the type of interpolation where both f(x) and 
f'{x) are given by the approximating polynomial 
at the given points, eq 13 becomes 

/(2»+2) (t\ 

R VW- (2n+2)\ ( x - x o) 2 (x-*i) 2 • • • (^-^) 2 - 

In the case of Newton-Cotes quadrature formulas, 
eq 13 also gives the same results as those obtained 
by the use of Bernoulli polynomials. 

Example 1. In the case of Newton's "three 
eighth's rule" in the form 

Q 7 

2/a — yo — g- (y's+ty2+3yi+yo)=ll(y), 

the basic set is 1, #, x 2 , x 3 , 2 4 , and the points a.re 

%Q) Xq~\~ h, Xq~y2iiI) Xq -\~ oil. 

There is no loss in generality if we take x rj = 0< 
since the formula is independent of any particular 
location on the z-axis. Then 



i?[x 5 ]=J3 5 -^(3 4 +3X2 4 +3Xl 4 +0 4 )U 5 =-9/i 5 /2, 

and the error turns out to be 

y (5) gh 5 _3hY 5) 



\R[y]\='- 



5! 2 



80 



Here, and elsewhere below, the argument of the 
derivative in the remainder has been suppressed. 

The theorem also applies to a wide variety of 
approximate formulas that are not commonly 
treated in texts on interpolation and numerical 
integration, and to which the customary methods 
for obtaining the error do not directly apply. 

A few such formulas, together with error terms, 
are listed below. 

h h 2 h 5 v {5) 

(i) y n -~y n -i=2 (y f n+yn-i)-j2 (^'-^-O+^p 

h h 2 

(2) y n — y n -i=2 (^+2/I-i)-jq (y"—y"-i) 

4-*_ ( >"- »> \ h7 y {7) 

-r ]20 [Vn V*-i) 100800 ' 

h hhi" h b n^ 

(3)2/„ +1 =2/»+^(72/: + . + 16y: + 2/I-i)+^| !L -y|o- 
(4) y n +2=y n +Y (5^+i— y' n — y'n-i)— 2h?y'J 



+ : 



45 



(5) yn+i=y n -i+ 15 (.7y' K+l +Uy' n +7y' n -i) 

h 2 ,, ,, h 7 y a) 
"16 ^»+'-2/»'->)+4725' 

Q 7 

(6) y„+i— 2y„+y n _ 1 = g - (yl+i—yl-i) 

-24<V„ + i-8B.+y.Ii) +6oi80- 



(7) »fi=y-i+iog (412/1+1+ 128 2 /:+4l2/:_ 1 ) 
9h 2 ft 3 

^'v /" 1 1 \ I 'V / III i - n tit i III \ 

-35 (2/»+i— yn-O+gjg (y»+i+i6y n +1/.-1) 

h n y m 



130,977,000 



These formulas have applications in the numeri- 
cal solution of differential equations. 
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XII. Future Investigations 

So far, the possibilities of securing usable re- 
mainder terms by the methods of this paper have 
not been fully explored. It is, for example, un- 
known whether these methods will give practically 
useful forms of the remainder for the case of 
polynomial approximation by Least Squares, for 
the case of Fourier series, and more general 
Fourier type expansions. These cases have, of 
course, been extensively studied by Dunham 
Jackson and others by using a very different line 
of attack. 

Another matter deserving attention is the dis- 



covery of general theorems by which it can be 
determined for whole classes of operators whether 
or not the associated function G(s) changes sign. 
For we have seen that when we know that 0(s) 
does not change sign the evaluation of the magni- 
tude of the error is considerably simplified. On 
the other hand, if it is necessary to compute G(s) 
in order to determine whether or not G(s) changes 
sign, the benefit of this simplification is consider- 
ably reduced. Except in a few special cases such 
as polynomial interpolation, no such general 
theorems are known. 

Los Angeles, February 25, 1949. 
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